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1 Ligation Method 

2 

3 Field of the Invention 

4 

5 This application relates to a method of ligating two 

6 or more molecules, for example, small organic 

7 molecules, labels, peptides etc. In particular it 

8 relates to a method of ligating a peptides, such as 

9 ligation of a synthetic peptide to a recombinant 
10 peptide. 

11 

12 Background to the Invention 
13 

14 Protein engineering methodologies have proven to be 

15 invaluable for generating protein based tools for 

16 application in basic research, diagnostics, drug 

17 discovery and as protein therapeutics. The ability 

18 to manipulate the primary structure of a protein in 

19 a controlled manner opens up many new possibilities 

20 in the biological and medical sciences. As a 

21 consequence, there is a concerted effort on 
.22 developing methodologies for the site-specific 

23 modification of proteins and their subsequent 

24 application. 
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1 . 

2 The two main approaches to generating proteins are 

3 through recombinant methods or chemical synthesis. 

4 To date, the two methods have proved to be 

5 complementary; recombinant methodologies enable 

6 proteins of any size to be generated but in general 

7 they are restricted to the assembly of the 

8 .. proteinogenic amino acids. Thus, in general, the 

9 introduction of labels and probes into recombinant 

10 proteins has to be implemented post-translationally 

11 and does not allow modifications to the protein 

12 backbone . 
13 

14 The most common methods for labelling a recombinant 

15 protein use an amino or a thiol reactive version of 

16 the label that will covalently react with a lysine 

17 side chain / N** amino group or a cysteine side chain 

18 within the protein respectively. For such labelling 

19 methods to be site-specific, an appropriate 

2 0 derivative of the protein must be engineered to 

21 contain a unique- reactive functionality at the 

22 . position to be modified. This requires all' the other 

23 naturally occurring reactive functionalities within 

24 the primary sequence to be removed through amino 

25 acid mutagenesis. In the case of protein amino 

26 functionalities, this is essentially impossible due 

27 to the abundance of lysine residues and the presence 

28 of the amino functionality at the N-terminus. 

29 Likewise, for cysteine this process is laborious, and 

30 is often detrimental -to the function of the protein. 
31 
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1 ' The production of proteins having site-specific 

2 modifications and/or labels is more readily 

3 achievable using chemical synthesis methods. The 

4 .chemical synthesis of ..proteins, however, enables 

5 multiple modifications to be incorporated into both 

6 side-chain and backbone moieties of the protein in a 

7 site- specif ic manner, but, in general, the maximum 

8 size of sequence that can be synthesised and 

9 isolated is circa 50 - 100 amino acids. 
10 

11 Protein Ligation 

12 A further approach to the generation of proteins is 

13 protein / peptide ligation. In this approach 

14 mutually reactive chemical functionalities 

15 (orthogonal to the chemistry of the naturally 

16 occurring amino acids i.e. which react by mutaually 

17 exclusive chemistries compared to the reactions of 

18 the reactive moieties of the naturally occuring 

19 amino acids) are incorporated at the N- and C- 

20 termini of unprotected polypeptide fragments such 

21 that when they are mixed, they react in a 

22 chemoselective manner to join the two sequences 

23 together (Cotton GJ and Muir TW. Chem.Biol., 1999, 

24 6, R247-R254) . The principle of chemical ligation is 

25 shown schematically in Figure'!. 
26 

27 A number of * chemistries have been utilised for the 

28 ligation of two synthetic peptides where a diverse 

29 range of different chemical functionalities can be 

3 0 incorporated into the termini of polypeptides using 

31 solid phase peptide synthesis. These include the 

32 reaction between a thioacid and bromo- alkyl to 
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1 form a thioester (Schnolzer M and Kent SBH, Science," 

2 1992, 256, 221-225), reaction of an aldehyde with an 

3 N-terminal cysteine or threonine to form 

4 thiazolidine or oxazolidine respectively (Liu C-F 

5 and Tarn J P. Proc. Natl. Acad. Sci. USA, 1994, 91, 

6 6584 - 6588), reaction between a hydrazide and an 

7 aidehyde to form a hydrazone (Gaertner HF et al, et 

8 al Bioconj . Chem., 1992, 3, 262 - 268) reaction of 

9 an aminoxy group and an aldehyde to form an oxime 

10 • (Rose K. J". Am. Chem. Soc, 1994, 116, 30-33), 

11 reaction of azides and aryl phosphines to form an 

12 amide bond (Staudinger ligation) (Nilsson BL, 

13 Kiessling LL, and Raines RT. Org. Lett., 2001, 3, 9- 

14 12, Kiick et al Proc. Natl. Acad. Sci. USA, 2 002, 

15 99, 19-24) , and the reaction of a peptide C- 

16 terminal thioester and an N-terminal cysteine 

17 peptide to form a native amide bond (Dawson et al. 

18 Science, 1994, 266, 776) (Native chemical ligation 

19 US6184344, EP 0832 096 Bl) . This method is an 

2 0 extension of. studies by Wieland and coworkers who 

21 . showed that the reaction of ValSPh and CysOH in 

22 aqueous buffer yielded the dipeptide ValCysOH 

23 (Wieland T et al, . Liebigs Ann. Chem., 1953, 583, 

24 129-149) . 
25 

26 Although the native chemical ligation method has 

27 proved popular, it requires an N-terminal cysteine 

28 and thus, if a cysteine is not present at the 

29 appropriate position in the protein, a cysteine 

30 needs to be introduced at the ligation site. 

31" However, the introduction of extra thiol groups into 

32 a. protein sequence maybe detrimental to its 
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1 structure / function, especially since cysteine has 

2 a propensity to form disulfide bonds which may 

3 disrupt the folding pathway or compromise the 

4 function of the folded protein. 
5 

6 As a consequence of the difficulties and problems 

7 associated with known ligation techniques, the 

8 ligation of two synthetic fragments generally only 

9 enables proteins of circa 100 - 150 amino acids to 

10 be chemically synthesised. Although larger proteins 

11 have been synthesised by ligating together more than 

12 two fragments, this has proved to be technically 

13 dif f icult (Camarero et al . J*. Pept . Res., 1998, 54, 

14 303-316, Canne LE et al, J". Am. Chem. Soc . , 1999, 

15 121, 8720-8727) . 
16 

17 Protein semi -synthesis 
18 

19 protein ligation technologies that enable both 

20 synthetic and recombinantly derived protein 

21 fragments to be joined together have been 

22 described. This enables large proteins to be 

23 constructed from combinations of synthetic and 

24 recombinant fragments allowing proteins to be site- 

25 specifically modified with both natural and 

26 unnatural entities. By utilising such so-called 

27 protein semi - synthesis , many different synthetic 

28 moieties can be site-specifically incorporated at 

29 . multiple different sites within a target protein. 
30 

31 In order to utilise recombinant proteins in ligation 

32 strategies the recombinant fragments must contain 
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1 the appropriate reactive functionalities to 

2 facilitate ligation. One approach to introduce a 

3 unique reactive functionality into a recombinant 

4 protein has been through the periodate oxidation of 

5 N-terminal serine containing sequences. Such 

6 treatment converts the N-terminal serine into a 

7 glyoxyl moiety, which contains ah N-terminal 

8 aldehyde. Synthetic hydrazide containing peptides 

9 have then been ligated to the N- terminus of these 

10 protein in a chemoselective manner through hydrazone 

11 bond formation with the protein N-terminal aldehyde 

12 • group (Gaertner HF et al , et al Bioconj . Chem. , 

13 1992 , 3, 262 - 268, Gaertner HF, et al . J. Biol. 

14 Chem., 1994, 269, 7224-7230). Another approach has 

15 been to generate recombinant .proteins with N- 

16 terminal cysteine residues. Synthetic peptides 

17 containing C- terminal thioesters have then been 

18 site-specifically attached to the N-terminus of 

19 these proteins via amide bond formation in a manner 

20 analogous to * native chemical ligation 1 (Cotton GO* 

21 and Muir TW. Chem. Biol., 2000, 7, 253-261) . However 

22 as with the ligation of synthetic peptides using 

23 native chemical ligation techniques, the technology 

24 requires a cysteine to be introduced at the ligation 

25 site if the primary sequence does not contain one at 

26 the appropriate position. 
27 

28 Protein Splicing Techniques 

29 

30 Recently technologies have been developed which 

31 enable recombinant proteins containing C-terminal 

32 thioester groups to be generated. The C-terminal 
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1 thioester functionality provides a unique reactive 

2 chemical group within the protein that can be 

3 utilised for protein ligation. Recombinant C- 

4 terminal thioester proteins are produced by 

5 manipulating a naturally occurring biological 

6 phenomenon known as protein splicing (Paulus H. Annu 

7 Rev Biochem 2000, 69, 447^496) . Protein splicing is 

8 a post- trans lational process in which a precursor 

9 protein undergoes a series of intramolecular 

10 rearrangements which result in precise removal of an 

11 internal region, referred to as an intein, and 

12 ligation of the two flanking sequences, termed 

13 exteins (Figure 2) . While there are generally no 

14 sequence requirements in either of the exteins, 

15 inteins are characterised by several conserved 

16 sequence motifs and well over a hundred members of 

17 this protein domain family have now been identified. 
18 

19 The first step in protein splicing involves an N— >S 

2 0 (or N-^O) acyl shift in which the N-extein unit is 

21 transferred to the sidechain SH or OH group of a 

22 conserved Cys/Ser/Thr residue, always located at the 

23 immediate N- terminus of the intein. Insights into 

24 this mechanism have led to the design of a number of 
2 5 ■ mutant inteins which can only promote the first step 

26 of protein splicing (Chong et al Gene. 1997, 192, 

27 271-281, (Noren et al . , Angew. Chem. Jnt . Ed. Engl., 

28 2000, 39, 450-466). Proteins expressed as in frame 

2 9 N- terminal fusions to one of these engineered 

3 0 inteins can be cleaved by thiols via an 

31 intermolecular transthioesterif ication reaction, to 

32 generate the recombinant protein C-terminal 



8 



1 thioester derivative (Chong et al Gene. 1997, 192, 

2 271-281, (Noren et al . , Angew. Chem. Int. Ed. Engl., 

3 2000, 39, 450-466) (New England Biolabs Impact System 

4 WO 00/18881, WO 0047751) . Peptide sequences 

5 containing an N- terminal cysteine residue can then 

6 be specifically ligated to the C-tfermini of such 

7 recombinant C- terminal thioester proteins (Muir et 

8 al Proc. Natl. Acad. Sci . USA. , 1998, 95, 6705-6710, 

9 • Evans Jr et al . Prot.- Scl., 1998, 7, 2256-2264) , in 

10 a procedure termed expressed protein ligation (EPL) 

11 or intein-mediated protein ligation (IPL) . The 

12 principle of intein-mediated protein ligation (IPL) 

13 is sillustrated schematically in Figure 3 . As with 

14 the previously described ligation techniques, such 

15 an approach requires a cysteine to be introduced at 

16 the ligation site if one is not suitably positioned 

17 with the primary protein sequence and thus is 

18 subject to the limitations and associated with the 

19 problems of these approaches, such as the potential 
2 0 problems associated with the introduction of an 

21 extra thiol group into the' primary sequence. 
22 

23 The chemoselective ligation of N-terminal cysteine 

24 containing peptides to C -terminal thioester 

25 containing peptides, be they synthetic or 

26 recombinant, is performed typically at slightly 

27 basic pH and in the presence of a thiol cof actor. 
2 8 The strategy also requires a cysteine to be 

2 9 introduced at the ligation site, if one is not 

3 0 suitably positioned within the primary sequence. 

31 These requirements of this ligation approach have 

32 the potential to alter the structure or function of 
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1 both the protein ligation product and the initial 

2 reactants. 
3 

4 Protein labelling 

5 

6 Historically protein ligation means the joining 

7 together of two peptide / protein, fragments but this 

8 is synonymous with protein labelling whereby the 

9 label is a peptide or derivatised peptide. Equally 

10 if a small non-peptidic synthetic molecule contains 

11 the necessary reactive chemical functionality for 

12 protein ligation, then ligation of the synthetic 

13 molecule directly to either the N- or C- termini of 

14 the protein affords site-specific labelling of the 

15 protein. Thus technologies developed for the 

16 ligation of protein fragments can also be used for 

17 the direct labelling of either the N- or C- termini 

18 of peptides or proteins in a site - specific manner 

19 irrespective of their sequence. 
20 

21 Recombinant proteins containing N- terminal glyoxyl 

22 functions (generated through periodate oxidation of 

23 the corresponding N-terminal serine protein) have 

24 been site- specif ic N- terminally labelled through 

25 reaction with hydrazide or aminoxy derivatives of 

26 the label (Geoghegan KF and Stroh JG. Bioconj Chem. , 

27 1992/ 3, 138-146, Alouni S et al. Eur. J. Biochem. , 

28 1995, 227, 328 - 334) . Also recombinant proteins 

2 9 containing N-terminal cysteine residues have been N- 

3 0 terminally labelled through reaction with thioester 

31 containing labels, the label being the acyl 

32 substituent of the thioester (Schuler B and Pannell 
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1 LK. Bioconjug. Chem. , 2002, 13, 1039-43) and 

2 aldehyde (Zhao et al . Bioconj . Chem.,1999, 10, 

3 424-430) functionalities to form amides and 

4 thiazolidines respectively. 
5 

6 Though a number of methods for ligation of proteins 

7' * exist each one has its potential drawbacks. There 

8 is thus a need for novel ligation methodologies, 

9 especially those that are compatible with both 

10 synthetic and recombinant fragments, which will 

11 complement the existing technologies and add another 

12 string to the protein engineers' bow. 
13 

14 Summary of the Invention 
15 

16 The present inventors have overcome a number of 

17 problems associated with the prior art and have 

18 developed a new method for ligating peptide 

19 molecules which overcomes a number of the problems 

20 of the prior art. 
21 

22 Accordingly, in a first aspect of the present 

23 invention, there is provided a method of producing 

24 -an oligopeptide product, the method comprising the 

25 steps: 

26 a) providing a first oligopeptide, the first 

27 oligopeptide having a reactive moiety, 

28 b) providing a second oligopeptide, the second 

29 oligopeptide having a activated ester moiety 

30 c) allowing the reactive moiety of the first 

31 oligopeptide to react with the activated ester 

32 moiety of the second oligopeptide to form an 
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1 oligopeptide product, in which the first and second 

2 ' oligopeptides are linked via a linking moiety having 

3 Formula I, Formula II or Formula III. 
4 

5 Formula I 

O 
II 

— C-NH-NH — 

6 

7 Formula II 

O 
II 

— C-NH-O — 

8 

9 Formula III 

O O 
II II 
— C-NH-NH— C — 

10 
11 
12 

13 In preferred embodiments, in step (c) , where said 

14 oligopeptides are linked via a linking moiety having 

15 Formula II and where said activated ester moiety of 

16 step (b) is not a thioester, said activated ester is 

17 a terminal activated ester moiety. 
18 

19 In further preferred embodiments of the invention, 

2 0 said linking moieties are linked via a linking 

21 moiety having Formula I or Formula III. 
22 

23 Unless the context demands otherwise, the terms 

24 peptide, oligopeptide, polypeptide and protein are 

25 used interchangeably. 
26 
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1 The activated ester moiety of the first oligopeptide 

2 may be any suitable activated ester moiety, such as 

3 a thioester moiety. a phenolic ester moiety, an 

4 hydroxy succinimide moiety, or an O-acylisourea 

5 moiety. 
6 

7 In preferred embodiments of the invention, the 

8 activated ester moiety is a thioester moiety. Any 

9 suitable thioester peptides may be used in the 

10 present invention. In preferred eembodiments , the 

11 thioester is a thioester wherein the peptide is the 

12 acyl substituent of the thioester. 
13 

14 Such thioester peptides may be synthetically or 

15 recombinantly produced. The skilled person is well 

16 aware of methods known in the art for generating 

17 synthetic peptide thioesters . For example, synthetic 

18 peptide thioesters may be produced via synthesis on 

19 a resin that generates a C-terminal thioester upon 

20 HF cleavage (Hojo et al , Bull. Chem. Soc. Jpn. , 

21 1993, 66, 2700-2706). Further, the use of 'safety 

22 catch' linkers has proved to be popular for 

23 generating C-terminal thioesters through thiol 

24 induced resin cleavage of the assembled peptide 

25 (Shin Y et al, J. Am. Chem. Soc, 1999, 121, 11684- 

26 11689) . 
27 

28 Moreover, recently technologies have been developed 

29 which enable recombinant C-terminal thioester 

30 proteins to be generated. Recombinant C-terminal 

31 thioester proteins may be produced by manipulating a 

32 naturally occurring biological phenomenon known as 
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1 protein splicing. As described above, protein 

2 splicing is a post-translational process in which a 

3 precursor protein undergoes a series of 

4 intramolecular rearrangements which result in 

5 precise removal of an internal region, referred to 

6 as an intein, and ligation of the two flanking 

7 sequences, termed exteins . 
8 

9 As described above, a number of mutant inteins which 

10 can only promote the first step of protein splicing 

11 have been designed (Chong et al Gene. 1997, 192, 

12 271-281, Noren et al . , An grew. Chem. Jnt. Ed. Engl., 

13 2000, 39, 450-466) . Proteins expressed as in frame 

14 N-terminal fusions to one of these engineered 

15 inteins can be cleaved by thiols via an 

16 intermolecular transthioesterif ication reaction, to 

17 generate the recombinant protein C-terminal 

18 thioester derivative (Chong et al Gene. 1997, 192, 

19 271-281, Noren et al . , Angew. Chem. Int. Ed. Engl ., 

20 2000, 39, 450-466) (New England Biolabs Impact 

21 .System WO 00/18881, WO 0047751). Such protein 

22 thioesters may be used in the methods of the 

23 invention (See Figure 3) . 
24 

25 Accordingly, in a preferred aspect of the present 

26 invention, in step (b) , the second oligopeptide is 

27 generated by thiol reagent induced cleavage of an 

28 intein. 
29 

30 Accordingly, in a second aspect of the present 

31 invention, there is provided a method of producing 

32 an oligopeptide product, the method comprising the 
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1 steps : 

2 a) providing a first oligopeptide, the first 

3 oligopeptide having a reactive moiety, 

4 b) (i) providing a precursor oligopeptide 

5 molecule, the precursor oligopeptide molecule 

6 comprising a second oligopeptide fused N- terminally 

7 to an intein domain 

8 (ii) allowing thiol reagent dependent cleavage of 

9 the precursor molecule to generate a second 

10 oligopeptide molecule, said second oligopeptide 

11 molecule having a thioester moiety at its C-terminus 

12 c) allowing the reactive moiety of the first 

13 oligopeptide to react with the second oligopeptide 

14 molecule to form an oligopeptide product, in which 

15 the first and second oligopeptides are linked via a 

16 linking moiety having Formula I, II or III. 
17 

18 The reactive moiety of the first oligopeptide may be 

19 any suitable reactive moiety. In preferred 

20 embodiments of the invention, the reactive moiety is 

21 a hydrazine moiety, an amino-oxy moiety or a 

22 hydrazide' moiety having general formula IV, V or VI 

23 respectively. 
24 

25 Formula IV 
— NH-NH 2 

26 
27 
28 

29 - 

30 

31 

32 Formula V 
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— 0-NH 2 

1 

2 Formula VI 
3 

O 
II 

— C-NH-NH 2 

4 

5 For example, in a particular preferred embodiment, 

6 the reactive moiety has Formula IV. and, in the 

7 oligopeptide product produced by the method of the 

8 invention, the first and second oligopeptides are 

9 linked via a linking moiety having Formula I. 
10 

11 In a further preferred embodiment, the reactive 

12 moiety has Formula V and, in the oligopeptide 

13 product produced by the method of the invention, the 

14 first and second oligopeptides are linked via a 

15 linking moiety having Formula II. 
16 

17 In another preferred embodiment, the reactive moiety 

18 has Formula VI and, in the oligopeptide product 

19 produced by the method of the invention, the first 
2 0 and second oligopeptides are linked via a linking 
21 moiety having Formula III. 

22 

23 As described above, the first oligopeptide comprises 

24 a reactive moiety, which, in preferred embodiments, 

25 may be a hydrazine moiety (e.g. Formula IV), an. 

26 . amino-oxy moiety (e.g. Formula V) or an hydrazide 

27 moiety (e.g. Formula VI) . 
28 

29 Hydrazine, hydrazide or aminooxy containing 

30 derivatives of synthetic oligopeptides may be 
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1 readily produced using known methods, for example, 

2 solid phase synthesis techniques. 
3 

4 Further, the present inventors have also found that 

5 proteins fused N-terminal to an intein domain can be 

6 cleaved from the intein by hydrazine treatment in a 

7 selective manner to liberate the desired protein as 

8 its corresponding hydrazide derivative (for example, 

9 see Figure 5) . 
10 

11 Accordingly, in further preferred embodiments of the 

12 invention, the first oligopeptide is generated by 

13 reaction of hydrazine with an oligopeptide molecule 

14 comprising the first oligopeptide fused N-terminal 

15 to an intein domain. 
16 

17 Indeed the discovery that such protein hydrazides 

18 may be produced by means of such a reaction forms an 

19 independent aspect of the present invention. 
20 

21 Accordingly, a third aspect of the invention 

22 provides a method of generating a protein hydrazide, 

23 said method comprising the steps: 

24 (a) providing an protein molecule comprising an 

25 oligopeptide fused N-terminal to an intein domain, 

26 (b) reacting said protein molecule with hydrazine, 

27 such that the intein domain is cleaved from the 

28 oligopeptide to generate a protein hydrazide. 
29 

3 0 Moreover, as well as using such a reaction to 

31 generate a first oligopeptide having a hydrazide 

32 moiety at its C- terminal, the first oligopeptide 
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1 thus being available for reaction with the second 

2 oligopeptide having the activated ester moiety, the 

3 present invention further extends to a 11 one -step" 

4 process for ligating two peptides to generate an 

5 oligopeptide product. 
6 

7 This may be achieved by reacting a suitable protein 

8 linked N- terminal to an intein directly with a 

9 polypeptide having a hydrazine, hydrazide or amino- 
10 oxy moiety. 

11 

12 Accordingly, in a fourth aspect, the invention 

13 provides a method of producing an oligopeptide 

14 product, the method comprising the steps: 

15 a) providing a first oligopeptide, the first 

16 oligopeptide having a reactive moiety, wherein the 

17 reactive moiety is a hydrazine moiety, a hydrazide 

18 moiety or an amino- oxy moiety; 

19 (i) providing a precursor oligopeptide molecule, the 

20 precursor oligopeptide molecule comprising a second 

21 oligopeptide fused N-terminally to an intein domain; 

22 (c) allowing the reactive moiety of the first 

23 oligopeptide to react with the precursor 

24 oligopeptide molecule to form an oligopeptide 

25 product, in which the first and second oligopeptides 

26 are linked via a linking moiety having Formula I, 

27 Formula II or Formula III. 
28 

29 The ligation technology of the present invention can 

3 0 thus utilise both synthetic and recombinant proteins 

31 and peptides. It thus enables the ligation of two or 

32 more synthetic, two or more recombinant or a mixture 
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1 of one or more synthetic with one or more 

2 recombinant peptides. 
3 

4 Moreover, as well as providing a novel method of 

5 ligating peptides, the present invention may be used 

6 for the labelling of synthetic or recombinant 

7 peptides. 
8 

9 Accordingly, in a fifth aspect of the present 

10 invention, there is provided a method of labelling 

11 an oligopeptide, the method comprising the steps: 

12 a) providing a label molecule, the label molecule 

13 having a reactive moiety, 

14 b) providing the oligopeptide, the oligopeptide 

15 having an activated ester moiety 

16 c) allowing the reactive moiety of the label 

17 molecule to react with the activated ester moiety of 

18 the oligopeptide to form the labelled oligopeptide, 

19 in which the label molecule and the oligopeptide are 

20 linked via a linking moiety having Formula I, 

21 Formula II or Formula III as defined above, 
22 

23 In preferred embodiments, in step (c) , where said 

24 label molecule and the oligopeptide are linked via a 

25 linking moiety having Formula II and where said 

26 activated ester moiety of step (b) is not a 

27 thioester, said activated ester is a terminal 
2 8 activated ester moiety. 

29 

30 Alternatively, a label molecule having a terminal 

31 activated ester moiety may be used to label an 

32 oligopeptide having a reactive moiety. Thus, in a 
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1 sixth aspect of the invention, there is provided a 

2 method of labelling an oligopeptide, the method 

3 comprising the steps: 

4 a) providing a label molecule, the label molecule 

5 having an activated ester moiety of which the label 

6 is the acyl substituent, 

7 b) providing the oligopeptide, the oligopeptide 

8 having a reactive moiety 

9 c) allowing the activated .ester moiety of the label 

10 molecule to react with the reactive moiety of the 

11 oligopeptide to form the labelled oligopeptide, in 

12 which the label molecule and the oligopeptide are 

13 linked via a linking moiety having Formula I, 

14 Formula II or Formula III 

15 wherein, in step (c) , where said label molecule 

16 and the oligopeptide are linked via a linking moiety 

17 having Formula II and where said activated ester 

18 moiety of step (b) is not a thioester, said 

19 activated ester is a terminal activated ester 

2 0 moiety. 
21 

22 As with the ligation technology, an oligopeptide 

23 present as a precursor molecule linked to an intein 

24 molecule may be labelled directly. Thus, a seventh 

25 aspect of the present invention provides a method of 

26 labelling an oligopeptide, the method comprising the 

27 steps: 

28 a) providing a label molecule, the label molecule 

29 having a reactive moiety, 

3 0 b) providing a precursor oligopeptide molecule, 

31 the precursor oligopeptide molecule comprising an 

32 oligopeptide fused N- terminally to an intein domain, 
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1 c) allowing the reactive moiety of the label 

2 molecule to react with the precursor oligopeptide 

3 molecule to form a labelled oligopeptide product, in 

4 which the label molecule and the oligopeptide are 

5 linked via a linking moiety having Formula I, 

6 Formula II or Formula III as defined above. 
7 

8 The methods of the invention are particularly useful 

9 in the ligation of peptides, in particular the 

10 ligation of peptides, which, using conventional 

11 ligation techniques, would require various 

12 protecting groups . The inventors have shown that 

13 the methods of the invention may be performed under 

14 pH conditions in which only the reactive moieties 

15 will react. 
16 

17 In preferred embodiments of the first to seventh 

18 aspects of the invention, the method is performed at 

19 a pH in the range pH 4 . 0 to pH 8.5, preferably pH 

20 4.0 to 7.5, more preferably in the range pH 4.5 to 

21 pH 7.0, most preferably in the range pH 5.5 to pH 

22 6.5. 
23 

24 For example, the inventors have demonstrated that 

25 synthetic peptide. C-terminal thioesters specifically 

26 react with hydrazine under aqueous conditions at pH 

27 6.0 to form the corresponding peptide hydrazide. 

2 8 This allows ligation methods as described herein to 

29 be performed at pH 6.0, without the need for a 

30 potentially harmful thiol cof actor (useful if either 

31 fragment or final construct is thiol sensitive) and 

32 does not lead to the introduction of potentially 
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1 reactive side-chain groups (such as a thiol) into 

2 the protein. Similarly, the inventors have 

3 demonstrated that synthetic peptide C-terminal 

4 thioesters specifically react with hydroxylamine 

5 under aqueous conditions at pH 6.0 and pH 6.8 to 

6 form the corresponding peptide hydroxamic acid. 
7 

8 In an analogous fashion, peptides that contain 

9 hydrazine, hydrazide or aminooxy groups can be 

10 reacted: with thioester derivatives of a label or a 

11 peptide to afford site-specific labelling and 

12 chemoselective ligation respectively (see, for 

13 example, figures 4 and 5) . 
14 

15 Furthermore, having demonstrated that recombinant 

16 protein hydrazides can be generated by cleavage of 

17 protein-intein fusions with hydrazine, the inventors 

18 have shown that such protein hydrazides may be 

19 ligated by reaction of the hydrazide moiety with 

20 reactive groups other than activated ester moieties, 

21 for example an aldehyde functionality, a ketone 

22 functionality or an isocyanate functionality. This 

23 aspect of the invention provides a further novel 

24 method of ligating a recombinant peptide to a second 

25 peptide or indeed a label. 
26 

2 7 Thus, an eighth aspect of the invention provides a 

28 method of producing an oligopeptide product, the 

29 method comprising the steps: 

30 • a) providing a first oligopeptide, the the first 

31 oligopeptide having an aldehyde or ketone moiety, 
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1 b) providing a precursor oligopeptide molecule, 

2 the precursor oligopeptide molecule comprising a 

3 second oligopeptide fused N-terminally to an intein 

4 domain, 

5 c) reacting said precursor oligopeptide molecule 

6 with hydrazine to generate an oligopeptide molecule 

7 comprising an intermediate oligopeptide , said 

8 intermediate oligopeptide having a C- terminal 

9 hydrazide moiety, 

10 d) allowing the aldehyde or ketone moiety of the 

11 first oligopeptide to react with the hydrazide 

12 moiety of the intermediate oligopeptide molecule to 

13 form an oligopeptide product, in which first 

14 oligopeptide and the second oligopeptide are linked 

15 via a hydrazone linking moiety. 
16 

17 An example of this aspect is shown in Figure 6. 
18 

19 A ninth aspect of the invention provides a method of 

20 labelling an oligopeptide, the method comprising the 

21 steps: 

22 a) providing a label molecule, the label molecule 

23 having a aldehyde or ketone moiety, 

24 b) providing a precursor oligopeptide molecule, 

25 the precursor oligopeptide molecule comprising a 

26 first oligopeptide fused N-terminally to an intein 

27 domain, 

28 c) reacting said precursor oligopeptide molecule 

29 with hydrazine to generate an oligopeptide molecule 
3 0 comprising an intermediate oligopeptide , said 

31 intermediate oligopeptide having a terminal 

32 hydrazide moiety, 
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1 . d) allowing the aldehyde or ketone moiety of the 

2 label molecule to react with the hydrazide moiety of 

3 the intermediate oligopeptide molecule to form a 

4 labelled oligopeptide product, in which the label 

5 molecule and oligopeptide are linked via a hydrazone 

6 linking moiety. 
7 

8 In preferred embodiments of the eighth and ninth 

9 aspects of the invention, the hydrazone moiety has 

10 Formula VII: 

11 
12 

13 - • 

— N-N=C 

I I 
H R 

14 

15 where R is H or any substituted or unsubstituted, 

16 preferably unsubstituted, alkyl group. 
17 

18 In preferred aspects of the eighth and ninth aspects 

19 of the invention, the method is performed at a pH in 
2 0 the range pH 1.0 to pH 7.0, preferably pH 1 . 0 to pH 

21 6.0, more preferably in the range pH 2.0 to pH 5.5, 

22 most preferably in the range pH 2.0 to pH 4.5. 
23 

24 In a tenth aspect of the present invention, there is 

25 provided an oligopeptide product produced using a 
2 6 method of the invention. 

27 

2 8 In an eleventh aspect, there is provided a labelled 

2 9 oligopeptide comprising an oligopeptide labelled 
30 according to a method of the invention. 
31 
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1 Preferred features of each aspect of the invention 

2 are as for each of the other aspects mutatis 

3 mutandis. 
4 

5 The invention will now be described further in the 

6 following non- limiting examples with reference made 

7 to the accompanying' drawings in which: 
8 

9 Figure 1 illustrates schematically the general 

10 principle of chemical ligation. 
11 

12 Figure 2 illustrates schematically the mechanism of 

13 protein splicing. 
14 

15 Figure 3 illustrates expressed or intein mediated 

16 protein ligation. 
17 

18 Figure 4 illustrates ligation of protein and peptide 

19 thioesters with • hydrazine and aminooxy containing 

20 entities, such as labels, peptides and proteins. 
21 

22 Figure 5 illustrates the generation of synthetic and 

23 recombinant peptide hydrazides for ligation with 

24 thioester containing molecules. Note the peptide or 

25 label is is the acyl substituent of the thioester. 
26 

27 Figure 6 illustrates the generation of recombinant 

28 peptide hydrazides for ligation with aldehyde and . 

29 ketone containing molecules. 
30 

31 Figure 7 illustrates SDS-PAGE analysis of Grb2-SH2 - 

32 GyrA - CBD (immobilised on chitin beads) treated 
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1 with DTT and MESNA. Molecular weight markers (lane 

2 1) ; purified Grb2-SH2 - GyrA - CBD immobilised on 

3 chit in beads (lane 4) . Grb2-SH2 - GyrA - CBD treated 

4 with 100 mM DTT (lanes 5 and 7) or 120 mM MESNA 

5 (lanes 8 and 10) . Both the whole reaction slurries 

6 (lanes 5 and 8) and the reaction supernatants (lanes 

7 7 and 10) were analysed. 
8 

9 Figure 8 illustrates SDS-PAGE analysis of Grb2-SH2 - 

10 GyrA - CBD (immobilised on chitin beads) treated 

11 with hydrazine. Molecular weight markers (lane 1) ; 

12 Purified Grb2-SH2 - GyrA - CBD immobilised on chitin 

13 beads after 2 Oh treatment with phosphate buffer only 

14 (lane 2) . Grb2-SH2 - GyrA - CBD treated with 2 00 mM 

15 hydrazine in phosphate buffer for 2 0 h. The whole 

16 reaction slurries were analysed. 
17 

18 Figure 9 illustrates an ESMS spectrum of the C- 

19 terminal hydrazide derivative of Grb2-SH2. 
20 

21 

22 Examples 

23 

24 Example 1 -Protein ligation / site specific protein 

25 labelling using the reaction of peptide / protein 

26 thioesters with compounds containing hydrazine / 

27 hydrazide or aminoxy functionalities. 
28 

29 Reaction of a peptide C-terminal thioester with 

30 lOOwM hydrazine at pH 6.0 

31 200 mM sodium phosphate buffer pH 6 . 0 containing 

32 lOOmM hydrazine monohydrate (200 pXi) was added to a 
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1 model synthetic peptide a-thioester termed AS626plA 

2 (2 00 |Xg) to yield a final peptide concentration of 

3 317 |1M. AS626plA has sequence ARTKQ TARK(Me) 3 

4 STGGKAPRKQ LATKAARK-COS- (CH 2 ) 2 -C0OC 2 H 5 (SEQ ID NO : 1) 

5 wherein a single Alanine residue (which may be any 

6 one of the Alanine residues of SEQ ID NO: 1) is 

7 substituted by an Arginine residue. The reaction was 

8 incubated at room temperature and monitored with 

9 time by analytical reversed phase HPLC. Vydac C18 

10 column (5 JiM, Q.46 x ) . Linear gradients of 

11 acetonitrile water / 0.1% TFA were used to elute the. 

12 peptides at a flow rate of • 1 mL min" 1 . Individual 

13 peptides eluting from the column were characterised 

14 by electrospray mass spectrometry. 
15 

16 Reaction of a peptide C- terminal thioester with 

17 lOOmM hydroxyl amine at pH 6.0 

18 200 mM- sodium phosphate buffer pH 6.0 containing 

19 lOOmM hydroxylamine hydrogen chloride (2 00 |1L) was 

20 added to AS626plA (200 fig) to yield a final peptide 

21 concentration of 317 |1M. The reaction was incubated 

22 at room temperature and monitored with time by 

23 analytical reversed phase HPLC. Vydac C18 column (5 

24 |1M, 0.46 x ). Linear gradients of acetonitrile water 

25 / 0.1% TFA were used to elute the peptides at a flow 

26 rate of 1 mL min' 1 . Individual peptides eluting from 

27 the column were characterised by electrospray mass 

28 spectrometry. 

29 . . 

3 0 Reaction of a peptide C- terminal thioester with 100 
31 . mM hydroxylamine at pH 6.8 
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1 200 mM sodium phosphate buffer pH 6 . 8 containing 

2 lOOmM hydroxylamine hydrogen chloride (200 |LLL) was 

3 added to AS626plA (200 fig) to yield a final peptide 

4 concentration of 317 JIM. The reaction was incubated 

5 at room temperature and monitored with time by 

6 analytical reversed phase HPLC. Vydac C18 column (5 

7 JIM, 0.46 x ). Linear gradients of acetonitrile water 

8 / 0.1% TFA were used to elute the peptides at a flow 

9 rate of 1 mL min" 1 . Individual peptides eluting from 

10 the column were characterised by electrospray mass 

11 spectrometry. 
12 

13 Reaction of a peptide C- terminal thioester with lOmM 

14 hydroxylamine at pH 6.8 

15 See above procedure. 
16 

17 Reaction of a peptide C-terminal thioester with lOmM 

1 8 hydroxyl amine a t pH 7 . 5 

19 See above procedure. 
20 

21 Reaction of a peptide C-terminal thioester with 2mM 

22 hydroxylamine at pH 7.5 

23 See above procedure. 
24 

25 Results 

26 These examples demonstrate the novel strategy for 

27 protein ligation / site specific protein labelling 
2 8 of both synthetic and recombinant protein sequences 
2 9 of the invention using the reaction of peptide / 

30 protein C-terminal thioesters with compounds 

31 containing hydrazine / hydra zide or aminoxy 

32 functionalities. 
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1 

2 As described above, a purified synthetic 27 amino 

3 acid a-thioester peptide (the ethyl 3- 

4 mercaptopropionate thioester derivative) was treated 

5 with hydrazine and hydroxy 1 amine under various 

6 conditions (Table 1) . 
7 

8 Treatment with 100 mM hydrazine at pH 6.0 formed a 

9 peptide species that eluted earlier than the 

10 starting thioester peptide as analysed by HPLC. This 

11 material was identified as the expected peptide 

12 hydrazide by ESMS : observed mass = 3054 Da, expected 

13 (av. isotope comp) 3053 Da. The reaction of the 

14 peptide C-terminal thioester with hydrazine to form 

15 the peptide hydrazide was monitored with time by 

16 reverse phase HPLC. Only the desired material was 

17 formed with no side product formation even after 3 

18 days- The stability of the peptide hydrazide, under 

19 the reaction conditions, indicates that the reaction 

20 occurs at the C-terminal thioester moiety and is 

21 chemoselective in nature. It also highlights the 

22 applicability of this reaction for protein ligation 

23 and labelling. (2 h 70% conversion , 4h 95% 

24 conversion) 
25 

26 To ascertain whether aminooxy containing compounds 

27 chemoselectively react with peptide / protein C- 

28 terminal thioesters, to afford protein ligation and 

29 site-specific labelling, a synthetic C-terminal 

30 thioester peptide was treated with hydroxylamine 

31 under various conditions (Table 1) . 
32 
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1 A purified synthetic 27 amino acid C-terminal 

2 thioester peptide (ethyl 3 -mercaptopropionate 

3 thioester, observed mass 3155 Da) was incubated at 

4 room temperature with different hydroxylamine 

5 concentrations in aqueous buffers of varying pH. In 

6 all cases the peptide C-terminal thioester reacted 

7 to form a single product that eluted earlier than 

8 the starting thioester peptide as analysed by 

9 reverse phase HPLC. This material corresponds to the 

10 expected hydroxamic acid peptide as determined by 

11 ESMS: observed mass = 3052 Da; expected (av. isotope 

12 comp) 3 054 Da. The kinetics of the reaction were 

13 monitored using reverse phase HPLC. The peptide C- 

14 terminal thioester is converted to the corresponding 

15 peptide hydroxamic acid in a clean fashion with no 

16 side-product formation. As expected increasing the 

17 pH of the reaction buffer accelerates the rate of 

18 reaction. With lOOmM NH 2 OH on moving from pH 6.0 to 

19 pH 6.8 the percentage product formation after lh 

20 increases from 25% to 91%. The rate of reaction with 

21 100 mM NH 2 OH pH 6.0 is comparable with 10 mM NH 2 OH at 

22 pH 6.8. 
23 

24 The rate of reaction of the peptide C-terminal 

25 thioester with hydroxalymine , to form the 

26 corresponding hydroxamic acid, increases with 

27 increasing pH and decreases with decreasing NH 2 OH 

28 concentrations. To identify conditions of pH and 

2 9 reactant concentration suitable for peptide / 

3 0 protein labelling and ligation, the labelling was 

31 performed under increasing pH and decreasing NH 2 0H 

32 c one en t r at i ons . 
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1 

2 The reaction with 10 mM was 83% complete after 4h at 

3 pH 6.8, while at pH 7.5 it was 83% complete after 

4 2h. On further decreasing the NH 2 0H concentration to 

5 2 mM the reaction rate at pH 7.5 decreased markedly, 

6 70% of the starting peptide' a-thioester being 

7 converted to the corresponding hydroxamic acid after 

8 8hrs. It was noted that a small amount of a side- 

9 product corresponding in mass to the peptide acid 

10 was formed during the reaction. Presumably this is 

11 formed by a competing hydrolysis side reaction at pH 

12 7.5, which was not observed with 10 mM NH 2 0H at pH 

13 7.5 due to the faster reaction at this higher 

14 react ant concentration. 
15 

16 



Reactant 


Concent 


pH 


Percentage product formation 




ration 




with 


time 














lhr 


2hr 


4hr 


8hr 


72hr 


NH 2 NH 2 


100 mM 


6.0 




70 


100 






NH 2 0H 


100 mM 


6.0 


25 


48.1 


76 .3 




100 


NH 2 0H 


100 mM 


6.8 


91 


100 








NH 2 OH 


10 mM 


6.8 


26 




83 


100 




NH 2 OH 


10 mM 


7.5 




82.7 


100 


100 




NH 2 OH 


2 mM 


7.5 


11.2 


17 


38 


70 


80* 



17 Table 1 

18 *A11 starting material has reacted with 8 0% 

19 conversion to the desired product and -20% to the 

20 hydrolysis side-product. 
21 

22 Example 2- Generation of recombinant C- terminal 

23 hydrazide proteins through the selective cleavage of 



protein - intein fusions with hydrazine/ and their 
subsequent use in ligation / labelling reactions. 

To investigate (i) the ability to generate 
recombinant C~ terminal hydrazide proteins through 
the selective cleavage of protein - intein fusions 
with hydrazine, and (ii) their subsequent use in 
ligation / labelling reactions, the SH2 domain of 
the adapter protein Grb2 was chosen as a model 
system. 

Sequence of human Grb2 SH2 domain 
HPW FFGKIPRAKA EEMLSKQRHD GAFLIRESES APGDFSLSVK 
FGNDVQHFKV LRDGAGKYFL WVVKFNSLNE LVDYHRSTSV 
SRNQQIFLRD IEQVPQQPT 

Expression of Grb2-SH2 domain - GyrA intein fusion- 

The DNA sequence encoding the SH2 domain of 
human Grb2 appended at its C- terminus with an extra 
glycine residue was cloned into the pTXBl expression 
plasmid (NEB) . This vector pTXBl Gr b2-SH2 <Gi y > encodes 
for a fusion protein whereby the SH2 domain of Grb2 
is linked via a glycine residue to the N-terminus of 
the GyrA intein, which is in turn fused to the N- 
terminus of a chitin binding domain region (CBD) . 
E. coli cells were transformed with this plasmid and 
grown in LB medium to mid log phase and protein 
expression induced for 4h at 37°C with 0.5 mM IPTG. 
After centrifugation the cells were re- suspended in 
lysis buffer (0.1 mM EDTA, 250 tnM NaCl, 5% glycerol, 
1 mM PMSF, 25 mM HEPES, pH 7.4) and lysed by 
sonication. The soluble fraction was loaded onto a 
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1 chitin column pre- equilibrated in lysis buffer. The 

2 column was then washed with wash buffer (1 mM EDTA, 

3 250 mM NaCl, 0.1% Triton-X 100, , 25 mM HEPES, pH 

4 7.0) to yield purified Grb2-SH2 - GyrA-CBD 

5 immobilised on chitin beads (Figure 7) . 
6 

7 Generation of Grb2-SH2 C- terminal thioesters by 

8 thiol induced cleavage of the Grh2-3H2 - GyrA intein . 

9 fusion. 

10 To ascertain that the intein domain within the 

11 protein was functional the fusion protein was 

12 exposed to thiols to assess the extent of cleavage 

13 via transthioesterif ication. Chitin beads containing 

14 immobilised Grb2-SH2 - GyrA-CBD were equilibrated 

15 into 2 00 mM NaCl, 2 00 mM phosphate buffer pH 7.4. 

16 Dithiothreitol (DTT) or 2 -mercaptoethanesulf onic 

17 acid (MESNA) were then added to the beads in 200 mM 

18 NaCl, 200 mM phosphate buffer pH 7.4 to give a 50% 

19 slurry with a final thiol concentration of 100 mM or 
2 0 120 mM respectively. The mixtures were then rocked 

21 at room temperature and aliquot s analysed by SDS- 

22 PAGE . After 48 hours the supernatants from the 

23 reactions were isolated and subsequently analysed by 

24 HPLC and ESMS . 

25 Treatment of Grb2-SH2 - GyrA intein - CBD 

26 fusion with both DTT and MESNA resulted in cleavage 

27 of the fusion protein into two protein species 

28 (Figure 7) . The molecular size of the two fragments 

2 9 corresponds to that of the Grb2 - SH2 and the GyrA - 

3 0 intein fusion, indicative that cleavage has taken 
31 place at the SH2 - intein junction. Cleavage of the 
3 2 precursor fusion protein liberated the SH2 domain 
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1 into the supernatant while the GyrA intein-CBD 

2 portion remained immobilized on the chitin beads. 

3 After cleavage with both DTT or ME SNA, ESMS analysis 

4 of the supernatants confirmed that the Grb2-SH2 was 

5 generated as either the expected DTT or ME SNA C- 

6 terminal thioester derivatives respectively. 

7 Expected mass of Grb2-SH2 DTT - C- terminal 

8 thioester = 12.173.9 Da; observed mass 12173.5 Da. 

9 Expected mass of Grb2-SH2 MESNA - C- terminal 

10 thioester = 12162.0 Da; observed mass 12163.0 Da. 
11 

12 Generation of Grb2-SH2 C- terminal hydrazide by 

13 hydrazine induced cleavage of the Grb2-SH2 - GyrA 

14 intein fusion. 
15 

16 The thioester linkage between Grb2-SH2 and the 

17 GyrA intein in the precursor fusion protein is 

18 expected to be cleaved with hydrazine, the 

19 chemoselective reaction of hydrazine, at the 

20 thioester moiety, liberating Grb2-SH2 domain into 

21 the supernatant as its corresponding C- terminal 

22 hydrazide derivative. Chitin beads containing 

23 immobilised Grb2-SH2 - GyrA-CBD were therefore 

24 equilibrated into 200 mM NaCl, 200 mM phosphate 

25 buffer pH 7 .'4 and hydrazine monohydrate added in the 

26 same buffer to give a 50% slurry with a final 

27 hydrazine concentration of 2 0 0. mM. The mixture was 

28 then rocked at room temperature and analysed by SDS- 

29 PAGE (Figure 8) . After 2 0 hours the supernatant was 

30 removed and analysed by HPLC and ESMS. 

31 Treatment of Grb2-SH2 - GyrA intein - CBD 

32 fusion with hydrazine resulted -in cleavage of the 
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1 fusion protein into two species. The molecular size 

2 of the two fragments as analysed by SDS-PAGE A 

3 corresponded to Grb2 - SH2 and the GyrA - intein 

4 fusion, indicative that cleavage has taken place at 

5 the unique thioester linkage between the SH2 - 

6 intein domains. Cleavage of the precursor fusion 

7 protein liberates the SH2 domain into the 

8 supernatant while the GyrA intein-CBD portion 

9 remained immobilized on the chitin beads. HPLC and 

10 ESMS analysis of the cleavage supernatant confirmed 

11 that a single protein species was generated that 

12 corresponds to the C-terminal hydrazide derivative 

13 of Grb2-SH2. Expected mass of Grb2-SH2 C-terminal 

14 hydrazide = 12051.7 Da; observed mass 12053.0 Da. 

15 (Figure 9) 
16 

17 After 20 h of reaction Grb2-SH2 C-terminal hydrazide 

18 was isolated form the supernatant using RPHPLC and 

19 lyophilised . 
20 

21 Ligation of aldehyde and ketone containing peptides 

22 and labels to recombinant C-terminal hydrazide 

23 containing proteins. 
24 

25 It was anticipated that recombinant protein C- 

26 terminal hydrazides, generated by hydrazine 

27 treatment of the corresponding intein fusion 

28 precursor, can be site-specifically modified by 

29 chemoselective ligation with aldehyde and ketone 

30 containing peptides and labels. To demonstrate such 

31 an approach the ability of a synthetic ketone 

32 containing peptide to ligate with the Grb2-SH2 C- 
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1 terminal hydrazide generated above was investigated. 

2 A synthetic peptide corresponding to the c-myc 

3 epitope sequence was synthesised GEQKLISEEDL-nh 2 . 

4 whereby pyruvic acid was coupled to the amino 

5 terminus of the peptide as the last step of the 

6 assembly. This peptide (designated CH 3 COCO-myc) was 

7 purifed to > 95% purity by RPHPLC and lyophilised 

8 (ESMS expected monoisotopic mass 1328.6 Da; observed 

9 mass 1328.6 Da). 

10 A sample of CH 3 COCO-myc peptide was dissolved 

11 in 100 raM sodium acetate buffer pH 4.5 to give a 4 

12 mM peptide concentration. This peptide solution (10 0 

13 |HL) was then added to an aliquot of lyophilised 

14 Grb2-SH2 C-terminal hydrazide protein (~ 250 and 

15 the reaction monitored by SDS-PAGE (Figure 10) As a 

16 control CH 3 COCO-myc was also incubated with 

17 Cytochrome C, a protein of similar same size to 

18 Grb2-SH2 but absent of a hydrazide functionality. 

19 SDS-PAGE analysis showed that CH 3 COCO-myc 

20 peptide has indeed ligated with Grb2-SH2 C-terminal 

21 hydrazide as indicated by the conversion of Grb2- 

22 SH2 C-terminal hydrazide into a protein species of 

23 a higher molecular weight (approximately 1000-2000 

24 Da higher) . The reaction was virtually complete 

25 after 24 h and the reaction product appeared to be 

26 stable. On the other hand there was no observable 

27 change to Cytochrome C with time i.e no ligation, 

28 establishing that the ligation reaction was 

29 occurring at the C-terminal hydrazide functionality 
3 0 of Grb2-SH2 . 

31 After 96 h of reaction the product from the 

32 Grb2-SH2 ligation reaction was isolated by HPLC and 
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1 characterised by ESMS. Chemoselective ligation of 

2 CH 3 COCO-myc to Grb2-SH2 C-terminal hydrazide via 

3 hydrazone bond formation would give a product of 

4 expected mass 13363.7 Da. The observed product mass 

5 was 13364.1 Da indicating that the desired ligation 

6 product had been formed . 
7 

8 In summary, the present invention provides novel 

9 methods of protein ligation that enable both 

10 synthetic and recombinantly derived protein 

11 fragments to be efficiently joined together in a 

12 regioselective manner. This thus enables large 

13 proteins to be constructed from combinations of 

14 synthetic and recombinant fragments and allows 

15 proteins of any size to be site-specifically 

16 modified in an unprecedented manner. This is of 

17 major importance for biological and biomedical 

18 science and drug discovery when one considers that 

19 the ~ 30,000 human genes yield hundreds of thousands 

20 of different protein species through post- 

21 translational modification. Such post- 
22 translationally modified proteins cannot be accessed 
23 through current recombinant technologies. 

24 

25 The application of such protein ligation techniques 

2 6 may be used for protein based tools, protein 

27 therapeutics and in de novo design and may open up 

2 8 many new avenues in biological and biomedical 

29 sciences that have hitherto not been possible. 

30 

31 All documents referred to in this specification are 

32 .herein incorporated by reference. Various 
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1 modifications and variations to the described 

2 embodiments of the inventions will be apparent to 

3 those skilled in the art without departing from the 

4 scope and spirit of the invention. Although the 

5 invention has been described in connection with 

6 specific preferred embodiments, it should be 

7 understood that the invention as claimed should not 

8 be unduly limited to* such specific embodiments. 

9 Indeed, various modifications of the described modes 

10 of carrying out the invention which are obvious to 

11 those skilled in the art are intended to be covered 

12 by the present invention. 
13 



38 



1 Claims 

2 

3 1. A method of producing- an oligopeptide product, 

4 the method comprising the steps: 

5 a) providing a first oligopeptide, the first 

6 oligopeptide having a reactive moiety, 

7 b) providing a second oligopeptide, the second 

8 oligopeptide having a activated ester moiety 

9 c) allowing the reactive moiety of the first 

10 oligopeptide to react with the activated ester 

11 moiety of the second oligopeptide to form an 

12 oligopeptide product, in which the first and second 

13 oligopeptides are linked via a linking moiety having 

14 Formula I, Formula II or Formula III". 
15 

16 Formula I 

O 
II 

— C-NH-NH — 

17 

18 Formula II 

O 
II 

— C-NH-O — 

19 

20 Formula III 

O O 
II li 
— C-NH-NH-C 

21 
22 
23 

24 2 . The method according to claim 1 wherein the 

25 terminal activated ester moiety is a thioester 

26 wherein the peptide is the acyl substituent- of 
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1 the thioester. 

2 

3 3. The method according to claim 2, wherein said 

4 second polypeptide is generated by thiol reagent 

5 . dependent cleavage of a precursor molecule, said 

6 precursor molecule comprising a second oligopeptide 

7 fused N- terminally to an intein domain. 

8 . 

9 4. A method of producing an oligopeptide product, 

10 the method comprising the steps: 

11 a) providing a first oligopeptide, the first 

12 oligopeptide having a reactive moiety, 

13 (i) providing a precursor oligopeptide molecule, the 

14 precursor oligopeptide molecule comprising a second 

15 oligopeptide fused N.-terminally to an intein domain 

16 (ii) allowing thiol reagent dependent cleavage of 

17 the precursor molecule to generate a second 

18 oligopeptide molecule, said second oligopeptide 

19 molecule having a thioester moiety at its C- 
2 0 terminus, 

21 c) allowing the reactive moiety of the first 

22 oligopeptide to react with the second oligopeptide 

23 molecule to form an oligopeptide product, in which 

24 the first and second oligopeptides are linked via a 

25 linking moiety having Formula I, II or III. 
26 

2 7 5. The method according to any one of the preceding 
2 8 claims wherein the reactive moiety is a hydrazine 

29 moiety, a hydrazide moiety or an aminooxy moiety, 

30 

31 6. The method according to claim 5, wherein .said 

32 first oligopeptide is produced by reaction of 
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1 hydrazine with a precursor molecule, said 

2 precursor molecule comprising a precursor 

3 oligopeptide fused N-terminally to an intein 

4 domain via a thioester moiety. 
5 

6 7. A method of producing an oligopeptide product, 

7 said method comprising the steps: 

8 a) providing a first oligopeptide, the first 

9 oligopeptide having a reactive moiety, wherein 

10 the reactive moiety is a hydrazine moiety, a 

11 hydrazide moiety or an amino-oxy moiety; 

12 (i) providing a precursor oligopeptide molecule, 

13 ' the precursor oligopeptide molecule comprising a 

14 second oligopeptide fused N-terminally to an 

15 intein domain; 

16 (c) allowing the reactive moiety of the first 

17 oligopeptide to react with the precursor 

18 oligopeptide molecule to form an oligopeptide 

19 product, in which the first and second 

2 0 oligopeptides are linked via a linking moiety 

21 having Formula I, Formula II or Formula III. 
22 

23 8. The method according to any one of the preceding 

24 claims, wherein the first oligopeptide or the 

25 second oligopeptide is a recombinant oligopeptide 

26 and the other of the the first oligopeptide and 

27 the second oligopeptide is a synthetic 

2 8 po lypep t i de . 
29 

3 0 9. The method according to any one of claims 1 to 7 , 
31 wherein the first oligopeptide and the second 
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1 oligopeptide are recombinant oligopeptides. 
2 

3 10 . The method according to any one of claims 1 to 

4 7, wherein the first oligopeptide and the second 

5 oligopeptide are synthetic oligopeptides. 
6 

7 11. A method of generating a protein hydrazide, 

8 said method comprising the steps: 

9 (a) providing a protein molecule comprising an 

10 oligopeptide fused N- terminal to an intein 

11 domain, 

12 (b) reacting said protein molecule with 

13 hydrazine, such that the intein domain is cleaved 

14 from the oligopeptide to generate a protein 

15 hydrazide. 
16 

17 12. The method according to any one of the 

18 preceding claims wherein the method is performed 

19 at a pH in the range pH 5.5 to 7.5. 
20 

21 13. A method of producing an oligopeptide product, 

22 the method comprising the steps: 

23 a) providing a first oligopeptide, the the first 

24 oligopeptide having an aldehyde or ketone moiety, 

25 b) providing a precursor oligopeptide molecule, 

26 the precursor oligopeptide molecule comprising a 

27 second oligopeptide fused N-terminally to an 
2 8 intein domain, 

2 9 c) reacting said precursor oligopeptide molecule 

3 0 with hydrazine to generate an oligopeptide 

31 molecule comprising an intermediate oligopeptide 

32 , said intermediate oligopeptide having a 
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1 terminal hydrazide moiety, 

2 d) allowing the aldehyde or ketone moiety of the 

3 . first oligopeptide to react with the hydrazide 

4 moiety of the intermediate oligopeptide molecule 

5 to form an oligopeptide product, in which first 

6 oligopeptide and the second oligopeptide are 

7 linked via a hydrazone linking moiety. 
8 

9 14. An oligopeptide product produced by the method 

10 of any one of the preceding claims. 
11 

12 15. A method of labelling an oligopeptide, the 

13 method comprising the steps: 

14 a) providing a label molecule, the label molecule 

15 having a reactive moiety, 

16 b) providing the oligopeptide, the oligopeptide 

17 having a activated ester moiety 

18 ' c) allowing the reactive moiety of the label 

19 molecule to react with the activated ester moiety 

20 of the oligopeptide to form the labelled 

21 oligopeptide; in which the label molecule and the 

22 oligopeptide are linked via a linking moiety 

23 having Formula I, Formula II or Formula III. 
24 

25 16. A method of labelling an oligopeptide, the 

26 method comprising the steps: 

2 7 a) providing a label molecule, the label molecule 

2 8 having a reactive moiety, 

29 b) providing the oligopeptide, the oligopeptide 

30 having a activated ester moiety 

31 c) allowing the reactive moiety of the label 

32 molecule to react with the activated ester moiety 
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1 of the oligopeptide to form the labelled 

2 oligopeptide, in which the label molecule and the 

3 oligopeptide are linked via a linking moiety 

4 having Formula I, Formula II or Formula III as 

5 defined above.* 
6 

7 17. A method of labelling an oligopeptide, the 

8 method comprising the steps: 

9 a) providing a label molecule, the label molecule 

10 having an activated ester moiety of which the 

11 label is the acyl substituent, 

12 b) providing the oligopeptide, the oligopeptide 

13 having a reactive moiety 

14 c ) allowing the activated ester moiety of the 

15 label molecule to react with the reactive moiety 

16 of the oligopeptide to form the labelled 

17 oligopeptide, in which the label molecule, and the 
.18 oligopeptide are linked via a linking moiety 

19 having Formula I, Formula II or Formula III, 

2 0 wherein, in step (c) , where said label molecule 

21 and the oligopeptide are linked via a linking 

22 moiety having Formula II and where said activated 

23 ester moiety of step (b) is not a thioester, said 

24 activated ester is a terminal activated ester 

25 moiety. 

26 18. A method of labelling an oligopeptide, the 

27 ( method comprising the steps: 

28 a) providing a label molecule, the label molecule 

2 9 having a reactive moiety, 

3 0 b) providing a precursor oligopeptide molecule, 

31 the precursor oligopeptide molecule comprising an 

32 oligopeptide fused N-terminally to an intein 
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1 domain, 

2 c) allowing the reactive moiety of the label 

3 molecule to react with the precursor oligopeptide 

4 molecule to form a labelled oligopeptide product, 

5 in which the label molecule and the oligopeptide 

6 are linked via a linking moiety having Formula I, 

7 Formula II or Formula III as defined above. 
8 

■9 19 . The method according to any one of claims 15 to 

10 18 wherein the method is performed at a pH in the 

11 range pH 5.5 to pH 7.5. 
12 

13 20. A method of method of labelling an 

14 oligopeptide, the method comprising the steps: 

15 a) providing a label molecule, the label molecule 

16 having a aldehyde or ketone moiety, 

17 b) providing a precursor oligopeptide molecule, 

18 the precursor oligopeptide molecule comprising a 

19 first oligopeptide fused N-terminally to an 
2 0 intein domain, 

21 c) reacting said precursor oligopeptide molecule 

22 with hydrazine to generate an oligopeptide 

23 molecule comprising an intermediate oligopeptide, 

24 said intermediate oligopeptide having a terminal 

25 hydrazide moiety, 

26 d) allowing the aldehyde or ketone moiety of the 

27 label molecule to react with the hydrazide moiety 
2 8 of the intermediate oligopeptide molecule to form 
2 9 a labelled oligopeptide product, in which the 

30 label molecule and oligopeptide are linked via a 

31 hydrazone linking moiety. 
32 
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1 21. A labelled oligopeptide produced by the method 

2 of any one of claims 15 to 20. 
3 

4 
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Figure 1 General principle of chemical ligation. 
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Figure 2 Mechanism of protein splicing 
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Figure 3 Expressed or intein mediated protein ligation 
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Figure 4 Ligation of protein and peptide thioesters with hydrazine and 
aminooxy containing entities such as labels, peptides and proteins. 
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Figure 5 Generation of synthetic and recombinant peptide hydrazides 
for ligation with thioester containing molecules 



- 6/10 - 



Recombinant Precursor 
Intein fusion protein 



I | N-Sacyl shift 

II 

o 

linked intermediate 



x v II Recombinant Precursor 

Protein '^^S intein fusion protein: Thioester 



H 3 N^- ( ,ntei " ) 



NH 2 NH 2 



O 

^ Proteln^^^r 



O 

N NHNH 2 

Recombinant peptide / protein hydrazide 



Ligation of Recombinant protein 
hydrazide to aldehyde (R=H) or 
ketone containing entities such 
labels, peptides or proteins 



O 





Figure 6 Generation of recombinant peptide hydrazides for ligation 
with aldehyde and ketone containing molecules 
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Figure 7. SDS-PAGE analysis of Grb2-SH2 - GyrA - CBD (immobilised on chitin 
beads) treated with DTT and MESNA. Molecular weight markers (lane 1); purified Grb2- 
SH2 - GyrA - CBD immobilised on chitin beads (lane 4). Grb2-SH2 - GyrA - CBD 
treated with 100 mM DTT (lanes 5 and 7) or 120 mM MESNA (lanes 8 and 10). Both the 
whole reaction slurries (lanes 5 and 8) and the reaction supernatants (lanes 7 and 10) 
were analysed. 
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Figure 8. SDS-PAGE analysis of Grb2-SH2 - GyrA - CBD (immobilised on chitin 
beads) treated with hydrazine. Molecular weight markers (lane 1); Purified Grb2-SH2 — 
GyrA - CBD immobilised on chitin beads after 20h treatment with phosphate buffer only 
(lane 2). Grb2-SH2 - GyrA - CBD treated with 200 mM hydrazine in phosphate buffer 
for 20 h. The whole reaction slurries were analysed. 
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Figure 9. ESMS spectrum of the C-terminal hydrazide derivative of Grb2-SH2 
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Figure 10. SDS-PAGE analysis of the reaction between synthetic ketone containing 
peptide CH3COCO-myc with Grb2-SH2 - C-terminal hydrazide and Cytochrome C. 
Molecular weight markers (lane 1); Grb2-SH2 - C-terminal DTT thioester (lane 2). 
Reaction between Grb2-SH2 - C-terminal hydrazide and CH3COCO-myc at time points 
t=0 h (lane 3), t=24 h (lane 4), t= 48h (lane 5) and t= 72 h (lanes 6). Reaction between 
Cytochrome C and CH3COCO-myc at time points t=0 h (lane 7), t=24 h (lane 8), t= 48h 
(lane 9) and t= 72 h (lanes 10). 



